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INHERENT  BIAS  IN  NONLINEAR  LEAST  SQUARES  ANALYSIS 
OF  FIRST-ORDER  KINETIC  DATA 


1 .  INTRODUCTION 

It  has  become  increasingly  popular  to  use  an  unweighted 
nonlinear  least  squares  analysis  to  evaluate  first-order  rate 
constants. The  typical  procedure  starts  with  a  data  set 
ti,yi  and  fits  it  to  a  function 

p^  =  a  +  be“^^  ( 1 ) 


where  p^^  is  the  predicted  (calculated)  value  of  the  measurement 
y^.  This  is  a  relatively  complex  problem  because  the  nonlinear 
fitting  routine  does  not  reduce  algebraically  (as  does  the  linear 
version)  so  that  various  values  of  the  constants  a,  b,  and  k  are 
tried,  and  a  minimum  variance  is  sought.  It  is  used  in  prefer¬ 
ence  to  its  linear  analog 


ln(yj^  -  a)  =  -kt  +  ln(b) 


(2) 


because  the  experimental  uncertainties  of  the  y^  values  do  not 
transform  linearly  into  In  space.  The  uncertainty  of  ln(yj^  -  a) 
increases  with  time  and  in  the  unweighted  procedure  introduces  a 
large  potential  error  into  the  determination  of  k. 

The  shortcomings  of  the  linear  procedure  are  well 
known^''^"®  but  no  one  has  carefully  examined  the  difficulties 
inherent  in  the  exponential  fit  routine.  There  is  the  tacit 
assumption  that,  except  for  the  mathematical  gymnastics,  the 
unweighted  exponential  fit  routine  is  the  perfect  solution  to  the 
data  analysis.  Actually,  it  has  the  same  difficulties  as  the 
linear  case;  they  are  just  more  deeply  hidden. 

The  unweighted  least  squares  procedure  makes  uje  of 
several  assumptions.  It  assumes  that  all  of  the  errors  can  be 
associated  with  one  variable  (y)  and  that  errors  are  random 
(i.e.,  Gaussian).  Goodness  of  fit  is  then  determined  by 
minimizing  the  sum  of  (p^^  -  y^^)^,  where  Pj^  is  the  theoretical 
(calculated)  value  determined  by  substituting  into  a  trial 
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v:!vjt  ion  [i.e.,  pi  =  f(tj^)].  Practically  a  data  set  is 

observed  and  matched  to  a  calculated  data  set 

Each  term  in  the  least  squares  sum 


(Vi  -  Pi)2 


(3) 


IS  considered  to  be  equally  valid  because  each  yj^  is  equally 
valid  (i.e.,  has  the  same  uncertainty).  The  problem  arises  from 
the  first  assumption:  all  the  errors  can  be  lumped  into  y.  This 
implies  that  p^  is  exact  because  it  is  calculated  from  tj^  that  is 
assumed  to  be  exact.  Because  both  t  and  y  are  experimental 
quantities,  the  validity  of  that  assumption  needs  to  be  examined. 

UNCERTAINTY  ANALYSIS  FOR  LINEAR  FUNCTIONS 

For  linear  functions,  there  is  no  problem;  and  the 
.issumption  holds,  viz.. 


Pi  =  Cti 


If  each  t^  is  associated  with  an  uncertainty  u^  that  is  constant 
tor  all  then  the  uncertainty  of  Pj^  (Up)  is  given  by 


u. 


=  Cu^ 


and  is  constant  for  all  of  the  calculated  Pj^  values;  and  each 
term  of  the  least  squares  sum  is  equally  precise. 

3.  UNCERTAINTY  ANALYSIS  FOB  NONLINEAR  FUNCTIONS 

For  nonlinear  functions,  the  situation  is  more  complex, 
tor  the  exponential  case,  equation  3  becomes 


p^  =  b  exp(-kt^) 


(6) 
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If  the  uncertainty  of  each  tj^  constant,  then  the  largest 

value  that  the  calculated  can  assume  is 


(largest)  =  b  exp[-k(t^  -  u^)] 


(7) 


=  b  exp(-ktj^)  exp(ku^) 


and  the  smallest  is 


Pj^  (smallest)  =  b  exp[-k(tj^  +  u^)  ] 


(3) 


=  b  exp(-ktj^)  exp(-ku^) 


Considering  the  error  associated  with  p^^  to  be  half  the  length  of 
its  error  bar,  then 


2Up  =  Pj^  (largest)  -  Pj^  (smallest)  (0) 

=  b  exp(-ktj^)  exp(ku^)  -  b  exp(-ktj^)  exp(-ku^) 

=  b  exp(-ktj^)  [exp(ku^)  -  b  exp(-ku^)] 


Since  b,  k,  and  u^  are  constants,  the  bracketed  factor  is  also  a 
constant.  Using  A  to  represent  collected  constants,  the  net 
result  is 


Up  =  A  b  exp(-ktji^) 


(10) 


that,  when  compared  to  equation  6,  becomes 


u 


P 


=  A  P' 


(11) 
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So  rather  than  being  a  constant  as  is  implicit  in  the  normal 
application  of  the  least  squares  analysis,  Up  is  a  linear 
function  of  or  its  experimental  counterpart  yj^. 

Using  relative  values  to  relate  to  the  chemical  problem 
of  first-order  kinetics,  y^  varies  from  100-0%.  Thus,  points 
near  100%  have  twice  the  error  as  points  near  50%  and  10  times 
the  error  as  points  near  10%.  Because  the  validity  (i.e., 
appropriate  weight)  of  a  point  relates  to  the  inverse  square  of 
its  uncertainty  and  because  chemical  reactions  are  typically 
followed  for  five  half-lives  (i.e.,  from  100-3%),  this  represents 
a  serious  problem  for  any  type  of  unweighted  analysis. 

4 .  CONCLUSION 


The  problem  relates  to  the  change  in  slope  (dy/dt)  from 
a  high  initial  value  to  zero  at  the  completion  of  the  reaction. 
During  the  first  stages  of  the  reaction,  the  concentration  of  the 
reac+ant,  y,  changes  very  rapidly.  Any  uncertainty  of  the  time 
of  a  data  point  t  in  that  region  does  not  add  any  uncertainty  to 
the  y  value  of  the  point,  but  it  does  have  a  large  effect  on  the 
corresponding  p  value*  because  it  is  calculated  directly  from  the 
time  t.  At  later  stages  of  the  reaction  where  y  is  changing  less 
rapidly,  uncertainties  in  the  time  values  add  much  less 
ancertainty  to  the  calculated  value  of  p.  The  net  result  is  that 
the  uncertainty  of  the  t,y  point  is  not  constant  for  the  duration 
of  the  reaction.  No  data  analysis  can  be  considered  completely 
appropriate  unless  this  is  taken  into  account. 
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